Replace pandas-based LP file writing with polars implementation #496

FabianHofmann · 2025-09-08T09:51:25Z

Remove pandas-based LP writing functions and replace with polars versions
Rename polars functions to remove '_polars' suffix for consistent API
Create separate get_printers_scalar() for non-LP functions (highspy, gurobi, mosek)
Update get_printers() to handle polars dataframes for LP writing
Consolidate "lp" and "lp-polars" io_api options to use same implementation
Remove unused imports and cleanup handle_batch function

Closes # (if applicable).

Changes proposed in this Pull Request

Checklist

Code changes are sufficiently documented; i.e. new functions contain docstrings and further explanations may be given in doc.
Unit tests for new features were added (if applicable).
A note for the release notes doc/release_notes.rst of the upcoming release is included.
I consent to the release of this PR's code under the MIT license.

- Remove pandas-based LP writing functions and replace with polars versions - Rename polars functions to remove '_polars' suffix for consistent API - Create separate get_printers_scalar() for non-LP functions (highspy, gurobi, mosek) - Update get_printers() to handle polars dataframes for LP writing - Consolidate "lp" and "lp-polars" io_api options to use same implementation - Remove unused imports and cleanup handle_batch function

FabianHofmann · 2025-09-08T10:42:29Z

the replacement comes with a small trade off for smaller problems where the pandas based IO is still a bit faster. however for larger problems polars significantly speeds up. any opinios @coroa @lkstrp ?

(note that the line of the crossover point in the upper right is a bit off, but you get the message)

FabianHofmann · 2025-09-08T10:45:47Z

also tagging @fneum

FabianHofmann · 2025-09-08T10:56:48Z

thinking about it I would say we go for it. we are talking about maximally 7 ms slower in bad configurations (tiny problems) but minutes faster for large problems, and the code get streamlined.

lkstrp

thinking about it I would say we go for it. we are talking about maximally 7 ms slower in bad configurations (tiny problems) but minutes faster for large problems, and the code get streamlined.

Agreed! Also the cleanup is nice

The polars migration broke NaN validation because check_has_nulls_polars only checked for null values, not NaN values. In polars, these are distinct concepts. This fix enhances the validation to detect both null and NaN values in numeric columns while avoiding type errors on non-numeric columns. Fixes failing tests in test_inconsistency_checks.py that expected ValueError to be raised when variables have NaN bounds.

fneum · 2025-09-08T11:54:32Z

Agree as well, but we should do a memory check comparison as well (e.g. with a PyPSA-EUR case).

FabianHofmann · 2025-09-08T12:13:00Z

Agree as well, but we should do a memory check comparison as well (e.g. with a PyPSA-EUR case).

good point, but no need to worry as we have the slicing logic which ensures that memory requirements are bound

fneum · 2025-09-08T12:19:18Z

Maybe someone can do it nevertheless? I am not sure if anyone ever tested the polars writing implementation on PyPSA-Eur type large problem (even if it has same slicing logic).

coroa · 2025-09-08T12:26:40Z

Anyone knows whether the polars code here is within the confines of the narwhals compat layer (which is a subset of the full polars api)? Then the switching between pandas and polars could easily be made a config option. And atlite does not have to depend on polars, but it could remain optional.

FBumann · 2025-11-03T10:54:53Z

@FabianHofmann @coroa Great Work.
I am seeing a decrease in time spent in model.to_file() of about 73 % (393 s -> 107 s).
With model.shape = (24_558_097, 14_991_445)

lkstrp · 2025-11-03T11:20:16Z

The PR was breaking though, and the release should have been 0.6.0 anyway. The two check_has_nulls_polars'in to_polars are raised if a constraint contains nans. This is the case for multi investment periods in PyPSA and KVL. I'm not even sure what the check is doing there.

Also filter_nulls

linopy/linopy/constraints.py

Line 631 in 6d1302e

short = filter_nulls_polars(short)

only checks for "empty" constraints (label is -1 or coeff is 0) but not for actual nans. While the line below not just treats "empty" constraints and raise actually nans:

linopy/linopy/constraints.py

Line 632 in 6d1302e

check_has_nulls_polars(short, name=f"{self.type} {self.name}")

@FabianHofmann Could you point me at a fix?
Should this have been dealt with already? Should filter_nulls_polars remove them? What is the purpose of check_has_nulls_polars anyway? I don't encounter any errors if I simply remove it. We can for sure make the naming clearer

FabianHofmann mentioned this pull request Sep 8, 2025

io: remove pandas based io for lp writing, use polars instead #366

Closed

lkstrp approved these changes Sep 8, 2025

View reviewed changes

FabianHofmann merged commit 2d69959 into master Sep 8, 2025
21 checks passed

FabianHofmann deleted the remove-pandas-lp-io branch September 8, 2025 12:14

coroa mentioned this pull request Sep 9, 2025

feat: Add sos constraints #495

Draft

4 tasks

lkstrp mentioned this pull request Nov 3, 2025

Issue related to links when running on PyPSA v1.0 open-energy-transition/plexos-to-pypsa-converter#90

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Replace pandas-based LP file writing with polars implementation #496

Replace pandas-based LP file writing with polars implementation #496

Uh oh!

FabianHofmann commented Sep 8, 2025 •

edited

Loading

Uh oh!

FabianHofmann commented Sep 8, 2025 •

edited

Loading

Uh oh!

FabianHofmann commented Sep 8, 2025

Uh oh!

FabianHofmann commented Sep 8, 2025

Uh oh!

lkstrp left a comment

Uh oh!

fneum commented Sep 8, 2025

Uh oh!

FabianHofmann commented Sep 8, 2025

Uh oh!

Uh oh!

fneum commented Sep 8, 2025 •

edited

Loading

Uh oh!

coroa commented Sep 8, 2025 •

edited

Loading

Uh oh!

FBumann commented Nov 3, 2025

Uh oh!

lkstrp commented Nov 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Replace pandas-based LP file writing with polars implementation #496

Replace pandas-based LP file writing with polars implementation #496

Uh oh!

Conversation

FabianHofmann commented Sep 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes proposed in this Pull Request

Checklist

Uh oh!

FabianHofmann commented Sep 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

FabianHofmann commented Sep 8, 2025

Uh oh!

FabianHofmann commented Sep 8, 2025

Uh oh!

lkstrp left a comment

Choose a reason for hiding this comment

Uh oh!

fneum commented Sep 8, 2025

Uh oh!

FabianHofmann commented Sep 8, 2025

Uh oh!

Uh oh!

fneum commented Sep 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coroa commented Sep 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

FBumann commented Nov 3, 2025

Uh oh!

lkstrp commented Nov 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

FabianHofmann commented Sep 8, 2025 •

edited

Loading

FabianHofmann commented Sep 8, 2025 •

edited

Loading

fneum commented Sep 8, 2025 •

edited

Loading

coroa commented Sep 8, 2025 •

edited

Loading